Goto

Collaborating Authors

 Robots


Entangled robotic matter with cohesive motion

Robohub

Cornell engineers have developed a robotic collective that behaves less like a machine and more like a material that flows, reshapes and adapts to its environment without centralized control. The system, called the Cross-Link Collective, consists of dozens of small robots that have limited mobility individually, but together exhibit coordinated and sustained motion. The research, published May 20 in Science Robotics, demonstrates a robotic system that resembles soft matter, continuously deforming and reorganizing as it moves, driven by what researchers call mechanical intelligence. "Instead of relying on explicit computation and communication, the system shifts the intelligence into the shape of the robots and their physical interactions," said corresponding author Kirstin Petersen, associate professor of electrical and computer engineering and the Aref and Manon Lahham Faculty Fellow in the Cornell Duffield College of Engineering. "We're leveraging the contact dynamics to let useful behaviors emerge, so the system naturally settles into configurations that reduce internal stresses and improve motion."


This startup was supposed to revolutionize California's wine industry: 'It totally failed'

Los Angeles Times

Things to Do in L.A. Tap to enable a layout that focuses on the article. This startup was supposed to revolutionize California's wine industry: 'It totally failed' Nilay Patel, left, interviews Monarch Tractor Chief Executive Praveen Penmetsa during Vox Media's 2023 Code Conference in Dana Point, Calif., in 2023. That year, Monarch was on a Forbes list of startups most likely to reach a $1-billion valuation. This is read by an automated voice. Please report any issues or inconsistencies here .


Grasp2Grasp: Vision-Based Dexterous Grasp Translation via Schrödinger Bridges

Neural Information Processing Systems

We propose a new approach to vision-based dexterous grasp translation, which aims to transfer grasp intent across robotic hands with differing morphologies. Given a visual observation of a source hand grasping an object, our goal is to synthesize a functionally equivalent grasp for a target hand without requiring paired demonstrations or hand-specific simulations.


Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset

Neural Information Processing Systems

We introduce Oxford Day-and-Night, a large-scale, egocentric dataset for novel view synthesis (NVS) and visual relocalisation under challenging lighting conditions. Existing datasets often lack crucial combinations of features such as ground-truth 3D geometry, wide-ranging lighting variation, and full 6DoF motion. Oxford Day-and-Night addresses these gaps by leveraging Meta ARIA glasses to capture egocentric video and applying multi-session SLAM to estimate camera poses, reconstruct 3D point clouds, and align sequences captured under varying lighting conditions, including both day and night. The dataset spans over 30 km of recorded trajectories and covers an area of 40,000 m2, offering a rich foundation for egocentric 3D vision research. It supports two core benchmarks, NVS and relocalisation, providing a unique platform for evaluating models in realistic and diverse environments.


Russian strikes kill nine in Ukraine and damage historic cathedral, officials say

BBC News

Nine people have been killed and several others injured in a wave of Russian strikes on Ukraine during which a major religious landmark in Kyiv caught fire, reports say. Four people were killed in attacks on Kyiv, while five rescue workers died trying to put out a fire caused by a Russian strike on the north-eastern city of Kharkiv, Ukrainian officials said. The 11th Century Dormition Cathedral was significantly damaged in what Ukrainian Prime Minister Yulia Svyrydenko called a brutal assault on our people and our heritage. Meanwhile, a Ukrainian drone attack in the Russian city of Tula, south of Moscow, killed three people and wounded three others, including a one-year-old, officials said. Drone and missile strikes set fire to buildings and cars and left more than 140,000 people in Ukraine's capital without electricity, Kyiv Mayor Vitali Klitschko said.


NFL-BA: Near-Field Light Bundle Adjustment for SLAM in Dynamic Lighting

Neural Information Processing Systems

Simultaneous distant terranean illumination; robotics, Localization and howe search v and er, man & Mapping rescue y real-w in (SLAM) collapsed orld scenarios, systems environments, such typically as endoscop require assume agents y static,, subto such operate cases, with dynamic a co-located near-field light lighting and camera introduces in the strong, absence vie of w-dependent external lighting.


SegGraph: Leveraging Graphs of SAMSegments for Few-Shot 3DPart Segmentation

Neural Information Processing Systems

This work presents a novel framework for few-shot 3D part segmentation. Recent advances have demonstrated the significant potential of 2D foundation models for low-shot 3D part segmentation. However, it is still an open problem that how to effectively aggregate 2D knowledge from foundation models to 3D. Existing methods either ignore geometric structures for 3D feature learning or neglects the high-quality grouping clues from SAM, leading to under-segmentation and inconsistent part labels. We devise a novel SAM segment graph-based propagation method, named SegGraph, to explicitly learn geometric features encoded within SAM's segmentation masks.


Domain Adaptation for and Real Policy Co Training

Neural Information Processing Systems

Behavior cloning has shown promise for robot manipulation, but real-world demonstrations are costly to acquire at scale. While simulated data offers a scalable alternative, particularly with advances in automated demonstration generation, transferring policies to the real world is hampered by various simulation and real domain gaps. In this work, we propose a unified sim-and-real co-training framework for learning generalizable manipulation policies that primarily leverages simulation and only requires a few real-world demonstrations. Central to our approach is learning a domain-invariant, task-relevant feature space. Our key insight is that aligning the joint distributions of observations and their corresponding actions across domains provides a richer signal than aligning observations (marginals) alone. We achieve this by embedding an Optimal Transport (OT)-inspired loss within the co-training framework, and extend this to an Unbalanced OT framework to handle the imbalance between abundant simulation data and limited real-world examples. We validate our method on challenging manipulation tasks, showing it can leverage abundant simulation data to achieve up to a 30% improvement in the real-world success rate and even generalize to scenarios seen only in simulation.


COS3D: Collaborative Open-Vocabulary 3DSegmentation

Neural Information Processing Systems

Open-vocabulary 3D segmentation is a fundamental yet challenging task, requiring a mutual understanding of both segmentation and language. However, existing Gaussian-splatting-based methods rely either on a single 3D language field, leading to inferior segmentation, or on pre-computed class-agnostic segmentations, suffering from error accumulation. To address these limitations, we present COS3D, a new collaborative prompt-segmentation framework that contributes to effectively integrating complementary language and segmentation cues throughout its entire pipeline. We first introduce the new concept of collaborative field, comprising an instance field and a language field, as the cornerstone for collaboration. During training, to effectively construct the collaborative field, our key idea is to capture the intrinsic relationship between the instance field and language field, through a novel instance-to-language feature mapping and designing an efficient two-stage training strategy. During inference, to bridge distinct characteristics of the two fields, we further design an adaptive language-to-instance prompt refinement, promoting high-quality prompt-segmentation inference. Extensive experiments not only demonstrate COS3D's leading performance over existing methods on two widely-used benchmarks but also show its high potential to various applications, i.e., novel image-based 3D segmentation, hierarchical segmentation, and robotics.


Watch: Protesters clash with police ahead of G7 summit in Geneva

BBC News

Protesters clashed with police forces during a demonstration against the upcoming G7 summit in Geneva. Tear gas and a water cannon were deployed to disperse the large crowd after protesters smashed windows and set a car on fire. What needs to be understood is the message, the basic message regarding all these countries that oppress us through money and power, said one protester who was disappointed to see the protest turn violent. The G7 summit starts on 15 June in Évian-les-Bains and will bring together the leaders of Britain, France, Canada, Germany, Italy, Japan, the United States and the European Union. Pope Leo XIV says Barcelona's iconic Sagrada Família is a masterpiece of stones, colours and light during his visit to Spain.